GeneRIF is a more comprehensive, current and computationally tractable source of gene-disease relationships than OMIM

نویسندگان

  • John D. Osborne
  • Simon Lin
  • Warren A. Kibbe
  • Maria I. Danila
  • Rex L. Chisholm
چکیده

Motivation: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. Methods: We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to data mine gene-disease relationships from both the GeneRIF and OMIM databases. We utilized a comprehensive subset of UMLS structured as a directed acyclic graph (the Disease Ontology) to filter and interpret results from MMTx. The data mining methodology was validated against the Homayouni gene collection using recall and precision measurements. Results: The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% (recall) and 98% (precision) using OMIM. Our thesaurusbased approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. Contact: [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The disease and gene annotations (DGA): an annotation resource for human disease

Disease and Gene Annotations database (DGA, http://dga.nubic.northwestern.edu) is a collaborative effort aiming to provide a comprehensive and integrative annotation of the human genes in disease network context by integrating computable controlled vocabulary of the Disease Ontology (DO version 3 revision 2510, which has 8043 inherited, developmental and acquired human diseases), NCBI Gene Refe...

متن کامل

Genetic Diagnosis of a Lethal Form of Autosomal Recessive Polycystic Kidney Disease

Background Autosomal recessive polycystic kidney disease (ARPKD; OMIM number 263200) is a severe early onset hereditary form of polycystic kidney and liver disease. Case Report In the current study, we present a consanguineous couple with a history of an affected son with polycystic kidney disease (PKD), hepatic failure and epileptic seizures who died at the age of 8 months. Both parents were h...

متن کامل

Sequence analysis of the VP1 gene in three very virulent Iranian infectious bursal disease virus strains

Infectious bursal disease (IBD) is a highly contagious disease of chickens caused by the infectious bursal disease virus (IBDV). This study was conducted to characterize three IBDV strains from Iran. A reverse transcriptase-polymerase chain reaction (RT-PCR) procedure was used to amplify a 715-bp fragment of the VP1 gene from IBDV strains. Amplified VP1 fragments of the three Iranian IBDV strai...

متن کامل

Finding GeneRIFs via Gene Ontology Annotations

A Gene Reference Into Function (GeneRIF) is a concise phrase describing a function of a gene in the Entrez Gene database. Applying techniques from the area of natural language processing known as automatic summarization, it is possible to link the Entrez Gene database, the Gene Ontology, and the biomedical literature. A system was implemented that automatically suggests a sentence from a PubMed...

متن کامل

A Framework for Annotating Human Genome in Disease Context

Identification of gene-disease association is crucial to understanding disease mechanism. A rapid increase in biomedical literatures, led by advances of genome-scale technologies, poses challenge for manually-curated-based annotation databases to characterize gene-disease associations effectively and timely. We propose an automatic method-The Disease Ontology Annotation Framework (DOAF) to prov...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007